Decentralized multi-agent reinforcement learning with networked agents: recent advances
نویسندگان
چکیده
Multi-agent reinforcement learning (MARL) has long been a significant research topic in both machine and control systems. Recent development of (single-agent) deep created resurgence interest developing new MARL algorithms, especially those founded on theoretical analysis. In this paper, we review recent advances sub-area topic: decentralized with networked agents. scenario, multiple agents perform sequential decision-making common environment, without the coordination any central controller, while being allowed to exchange information their neighbors over communication network. Such setting finds broad applications operation robots, unmanned vehicles, mobile sensor networks, smart grid. This covers several our endeavors direction, as well progress made by other researchers along line. We hope that promotes additional efforts exciting yet challenging area.
منابع مشابه
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
We consider the problem of fully decentralized multi-agent reinforcement learning (MARL), where the agents are located at the nodes of a time-varying communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the informatio...
متن کاملCoordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND-POMDP) provides a framework to model such cooperative multi-agent decision making. Existing work on ND-POMDPs has focused on offline techniques that require accurate models, which are usually costly to obtain in pract...
متن کاملDynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning
In reinforcement learning, agents learn by performing actions and observing their 1 outcomes. Sometimes, it is desirable for a human operator to interrupt an agent 2 in order to prevent dangerous situations from happening. Yet, as part of their 3 learning process, agents may link these interruptions, that impact their reward, to 4 specific states and deliberately avoid them. The situation is pa...
متن کاملDecentralized multi-agent reinforcement learning in average-reward dynamic DCOPs
Researchers have introduced the Dynamic Distributed Constraint Optimization Problem (Dynamic DCOP) formulation to model dynamically changing multi-agent coordination problems, where a dynamic DCOP is a sequence of (static canonical) DCOPs, each partially different from the DCOP preceding it. Existing work typically assumes that the problem in each time step is decoupled from the problems in oth...
متن کاملMulti-Agent Reinforcement Learning
This thesis presents a novel approach to provide adaptive mechanisms to detect and categorise Flooding-Base DoS (FBDoS) and Flooding-Base DDoS (FBDDoS) attacks. These attacks are generally based on a flood of packets with the intention of overfilling key resources of the target, and today the attacks have the capability to disrupt networks of almost any size. To address this problem we propose ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Frontiers of Informaion Technology & Electronic Engineering
سال: 2021
ISSN: ['2095-9184', '2095-9230']
DOI: https://doi.org/10.1631/fitee.1900661